Job Description:This is a Big Data Administrator Lead position, not a developer position, and it is a hands-on role. Please do not apply if you have not set up Cloudera Solr, HBase, or led a Cloudera Admin team.
- The Lead Data Engineer is responsible for orchestrating, deploying, maintaining and scaling cloud infrastructure targeting big data and platform data management (e.g., data warehouses, data lakes) including data access APIs.
- Prepares and manipulates data using Hadoop or equivalent.) with emphasis on high availability, reliability, automation and performance.
- This role will focus on leading the automation and components (Solr, HBase) set up and improvement for Cloudera CDP public cloud and integrate with other AWS services.
- Advanced (expert preferred) level experience in administrating and engineering relational databases (ex. MySQL, PostgreSQL), Big Data systems (ex. Cloudera Data Platform Private Cloud and Public Cloud), Apache Solr as SME, automation tools (ex. Ansible, Terraform, Bit Bucket) and experience working cloud solutions (specifically data products on AWS) are necessary.
- At least 10 years of Experienced with all the tasks involved in administration of big data and Meta Data Hub such as Cloudera. Cloudera Solr and HBase experience is a MUST.
- Experience with Ab Initio, EMR, S3, Dynamo DB, Mongo DB, ProgreSQL, RDS, DB2 is a Plus.
- DevOps (CI/CD Pipeline) is a Plus.
- Experience with Advance knowledge of UNIX and SQL
- Experience with manage metadata hub-MDH, Operational Console and troubleshoot environmental issues which affect these components
- Represents team in all architectural and design discussions. Knowledgeable in the end-to-end process and able to act as an SME providing credible feedback and input in all impacted areas. Require tracking and monitoring projects and tasks as the lead as well as taking on hands on work.